Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 45215 |
| Missing cells | 8 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 4 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 5.9 MiB |
| Average record size in memory | 136.0 B |
Variable types
| Numeric | 7 |
|---|---|
| Categorical | 6 |
| Boolean | 4 |
| Dataset has 4 (< 0.1%) duplicate rows | Duplicates |
education is highly imbalanced (51.3%) | Imbalance |
default is highly imbalanced (87.0%) | Imbalance |
poutcome is highly imbalanced (63.7%) | Imbalance |
balance is highly skewed (γ1 = 57.21509551) | Skewed |
previous is highly skewed (γ1 = 41.84300806) | Skewed |
balance has 3514 (7.8%) zeros | Zeros |
previous has 36957 (81.7%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-21 18:06:53.869313 |
|---|---|
| Analysis finished | 2024-05-21 18:07:43.627191 |
| Duration | 49.76 seconds |
| Software version | ydata-profiling v4.8.3 |
| Download configuration | config.json |
age
Real number (ℝ)
| Distinct | 85 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.004711 |
| Minimum | 18 |
|---|---|
| Maximum | 776 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 27 |
| Q1 | 33 |
| median | 39 |
| Q3 | 48 |
| 95-th percentile | 59 |
| Maximum | 776 |
| Range | 758 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 12.036647 |
|---|---|
| Coefficient of variation (CV) | 0.29354304 |
| Kurtosis | 474.51164 |
| Mean | 41.004711 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 10.248147 |
| Sum | 1854028 |
| Variance | 144.88088 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 32 | 2084 | 4.6% |
| 31 | 1995 | 4.4% |
| 33 | 1972 | 4.4% |
| 34 | 1930 | 4.3% |
| 35 | 1894 | 4.2% |
| 36 | 1806 | 4.0% |
| 30 | 1757 | 3.9% |
| 37 | 1696 | 3.8% |
| 39 | 1486 | 3.3% |
| 38 | 1466 | 3.2% |
| Other values (75) | 27129 |
| Value | Count | Frequency (%) |
| 18 | 12 | < 0.1% |
| 19 | 35 | 0.1% |
| 20 | 50 | 0.1% |
| 21 | 79 | 0.2% |
| 22 | 129 | 0.3% |
| 23 | 201 | 0.4% |
| 24 | 302 | 0.7% |
| 25 | 527 | |
| 26 | 805 | |
| 27 | 909 |
| Value | Count | Frequency (%) |
| 776 | 1 | |
| 530 | 1 | |
| 490 | 1 | |
| 466 | 1 | |
| 399 | 1 | |
| 332 | 1 | |
| 311 | 1 | |
| 123 | 1 | |
| 95 | 2 | |
| 94 | 1 |
job
Categorical
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 353.4 KiB |
| blue-collar | |
|---|---|
| management | |
| technician | |
| admin. | |
| services | |
| Other values (13) |
Length
| Max length | 14 |
|---|---|
| Median length | 13 |
| Mean length | 9.486077 |
| Min length | 6 |
Characters and Unicode
| Total characters | 428894 |
|---|---|
| Distinct characters | 32 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | management |
|---|---|
| 2nd row | technician |
| 3rd row | entrepreneur |
| 4th row | blue-collar |
| 5th row | unknown |
Common Values
| Value | Count | Frequency (%) |
| blue-collar | 9731 | |
| management | 9455 | |
| technician | 7599 | |
| admin. | 5168 | |
| services | 4153 | |
| retired | 2263 | 5.0% |
| self-employed | 1578 | 3.5% |
| entrepreneur | 1487 | 3.3% |
| unemployed | 1303 | 2.9% |
| housemaid | 1240 | 2.7% |
| Other values (8) | 1236 | 2.7% |
Length
| Value | Count | Frequency (%) |
| blue-collar | 9731 | |
| management | 9459 | |
| technician | 7599 | |
| admin | 5168 | |
| services | 4154 | |
| retired | 2264 | 5.0% |
| self-employed | 1579 | 3.5% |
| entrepreneur | 1487 | 3.3% |
| unemployed | 1303 | 2.9% |
| housemaid | 1240 | 2.7% |
| Other values (3) | 1229 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 64552 | |
| n | 45362 | |
| a | 42658 | |
| l | 33654 | 7.8% |
| c | 29083 | 6.8% |
| m | 28205 | 6.6% |
| i | 28033 | 6.5% |
| r | 22876 | 5.3% |
| t | 22689 | 5.3% |
| u | 14987 | 3.5% |
| Other values (22) | 96795 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 412391 | |
| Dash Punctuation | 11310 | 2.6% |
| Other Punctuation | 5168 | 1.2% |
| Uppercase Letter | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 64552 | |
| n | 45362 | |
| a | 42658 | |
| l | 33654 | |
| c | 29083 | 7.1% |
| m | 28205 | 6.8% |
| i | 28033 | 6.8% |
| r | 22876 | 5.5% |
| t | 22689 | 5.5% |
| u | 14987 | 3.6% |
| Other values (12) | 80292 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 6 | |
| A | 4 | |
| N | 4 | |
| E | 4 | |
| G | 2 | 8.0% |
| T | 2 | 8.0% |
| S | 2 | 8.0% |
| R | 1 | 4.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11310 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5168 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 412416 | |
| Common | 16478 | 3.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 64552 | |
| n | 45362 | |
| a | 42658 | |
| l | 33654 | |
| c | 29083 | 7.1% |
| m | 28205 | 6.8% |
| i | 28033 | 6.8% |
| r | 22876 | 5.5% |
| t | 22689 | 5.5% |
| u | 14987 | 3.6% |
| Other values (20) | 80317 |
Common
| Value | Count | Frequency (%) |
| - | 11310 | |
| . | 5168 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 428894 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 64552 | |
| n | 45362 | |
| a | 42658 | |
| l | 33654 | 7.8% |
| c | 29083 | 6.8% |
| m | 28205 | 6.6% |
| i | 28033 | 6.5% |
| r | 22876 | 5.3% |
| t | 22689 | 5.3% |
| u | 14987 | 3.5% |
| Other values (22) | 96795 |
marital
Categorical
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 353.4 KiB |
| married | |
|---|---|
| single | |
| divorced | |
| div. | 7 |
| Single | 4 |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.8316672 |
| Min length | 4 |
Characters and Unicode
| Total characters | 308887 |
|---|---|
| Distinct characters | 22 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | married |
|---|---|
| 2nd row | single |
| 3rd row | married |
| 4th row | married |
| 5th row | single |
Common Values
| Value | Count | Frequency (%) |
| married | 27215 | |
| single | 12787 | |
| divorced | 5198 | 11.5% |
| div. | 7 | < 0.1% |
| Single | 4 | < 0.1% |
| DIVORCED | 3 | < 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| married | 27215 | |
| single | 12791 | |
| divorced | 5201 | 11.5% |
| div | 7 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 59628 | |
| i | 45211 | |
| e | 45204 | |
| d | 37618 | |
| m | 27215 | |
| a | 27215 | |
| n | 12791 | 4.1% |
| g | 12791 | 4.1% |
| l | 12791 | 4.1% |
| s | 12787 | 4.1% |
| Other values (12) | 15636 | 5.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 308852 | |
| Uppercase Letter | 28 | < 0.1% |
| Other Punctuation | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 59628 | |
| i | 45211 | |
| e | 45204 | |
| d | 37618 | |
| m | 27215 | |
| a | 27215 | |
| n | 12791 | 4.1% |
| g | 12791 | 4.1% |
| l | 12791 | 4.1% |
| s | 12787 | 4.1% |
| Other values (3) | 15601 | 5.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 6 | |
| S | 4 | |
| I | 3 | |
| V | 3 | |
| O | 3 | |
| R | 3 | |
| C | 3 | |
| E | 3 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 308880 | |
| Common | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 59628 | |
| i | 45211 | |
| e | 45204 | |
| d | 37618 | |
| m | 27215 | |
| a | 27215 | |
| n | 12791 | 4.1% |
| g | 12791 | 4.1% |
| l | 12791 | 4.1% |
| s | 12787 | 4.1% |
| Other values (11) | 15629 | 5.1% |
Common
| Value | Count | Frequency (%) |
| . | 7 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 308887 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 59628 | |
| i | 45211 | |
| e | 45204 | |
| d | 37618 | |
| m | 27215 | |
| a | 27215 | |
| n | 12791 | 4.1% |
| g | 12791 | 4.1% |
| l | 12791 | 4.1% |
| s | 12787 | 4.1% |
| Other values (12) | 15636 | 5.1% |
education
Categorical
IMBALANCE 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 353.4 KiB |
| secondary | |
|---|---|
| tertiary | |
| primary | |
| unknown | 1855 |
| SECONDARY | 3 |
| Other values (5) | 8 |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 8.3201884 |
| Min length | 3 |
Characters and Unicode
| Total characters | 376189 |
|---|---|
| Distinct characters | 30 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | tertiary |
|---|---|
| 2nd row | secondary |
| 3rd row | secondary |
| 4th row | unknown |
| 5th row | unknown |
Common Values
| Value | Count | Frequency (%) |
| secondary | 23197 | |
| tertiary | 13302 | |
| primary | 6849 | 15.1% |
| unknown | 1855 | 4.1% |
| SECONDARY | 3 | < 0.1% |
| Primary | 2 | < 0.1% |
| sec. | 2 | < 0.1% |
| UNK | 2 | < 0.1% |
| Secondary | 1 | < 0.1% |
| Tertiary | 1 | < 0.1% |
| (Missing) | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| secondary | 23201 | |
| tertiary | 13303 | |
| primary | 6851 | 15.2% |
| unknown | 1855 | 4.1% |
| sec | 2 | < 0.1% |
| unk | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 63506 | |
| a | 43352 | |
| y | 43352 | |
| e | 36503 | |
| n | 28763 | |
| t | 26605 | |
| o | 25053 | 6.7% |
| c | 23200 | 6.2% |
| s | 23199 | 6.2% |
| d | 23198 | 6.2% |
| Other values (20) | 39458 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 376150 | |
| Uppercase Letter | 37 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 63506 | |
| a | 43352 | |
| y | 43352 | |
| e | 36503 | |
| n | 28763 | |
| t | 26605 | |
| o | 25053 | 6.7% |
| c | 23200 | 6.2% |
| s | 23199 | 6.2% |
| d | 23198 | 6.2% |
| Other values (6) | 39419 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 5 | |
| S | 4 | |
| E | 3 | |
| C | 3 | |
| O | 3 | |
| D | 3 | |
| A | 3 | |
| R | 3 | |
| Y | 3 | |
| P | 2 | 5.4% |
| Other values (3) | 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 376187 | |
| Common | 2 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 63506 | |
| a | 43352 | |
| y | 43352 | |
| e | 36503 | |
| n | 28763 | |
| t | 26605 | |
| o | 25053 | 6.7% |
| c | 23200 | 6.2% |
| s | 23199 | 6.2% |
| d | 23198 | 6.2% |
| Other values (19) | 39456 |
Common
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 376189 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 63506 | |
| a | 43352 | |
| y | 43352 | |
| e | 36503 | |
| n | 28763 | |
| t | 26605 | |
| o | 25053 | 6.7% |
| c | 23200 | 6.2% |
| s | 23199 | 6.2% |
| d | 23198 | 6.2% |
| Other values (20) | 39458 |
default
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.3 KiB |
| False | |
|---|---|
| True | 816 |
| Value | Count | Frequency (%) |
| False | 44399 | |
| True | 816 | 1.8% |
balance
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 7168 |
|---|---|
| Distinct (%) | 15.9% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1374.1599 |
| Minimum | -8019 |
|---|---|
| Maximum | 527532 |
| Zeros | 3514 |
| Zeros (%) | 7.8% |
| Negative | 3767 |
| Negative (%) | 8.3% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | -8019 |
|---|---|
| 5-th percentile | -172 |
| Q1 | 72 |
| median | 448 |
| Q3 | 1428 |
| 95-th percentile | 5769 |
| Maximum | 527532 |
| Range | 535551 |
| Interquartile range (IQR) | 1356 |
Descriptive statistics
| Standard deviation | 3924.2555 |
|---|---|
| Coefficient of variation (CV) | 2.8557489 |
| Kurtosis | 7197.9493 |
| Mean | 1374.1599 |
| Median Absolute Deviation (MAD) | 448 |
| Skewness | 57.215096 |
| Sum | 62129890 |
| Variance | 15399781 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 3514 | 7.8% |
| 1 | 195 | 0.4% |
| 2 | 156 | 0.3% |
| 4 | 139 | 0.3% |
| 3 | 134 | 0.3% |
| 5 | 113 | 0.2% |
| 6 | 88 | 0.2% |
| 8 | 81 | 0.2% |
| 23 | 75 | 0.2% |
| 10 | 69 | 0.2% |
| Other values (7158) | 40649 |
| Value | Count | Frequency (%) |
| -8019 | 1 | |
| -6847 | 1 | |
| -4057 | 1 | |
| -3372 | 1 | |
| -3313 | 1 | |
| -3058 | 1 | |
| -2827 | 1 | |
| -2712 | 1 | |
| -2604 | 1 | |
| -2282 | 1 |
| Value | Count | Frequency (%) |
| 527532 | 1 | |
| 102127 | 1 | |
| 98417 | 1 | |
| 81204 | 2 | |
| 71188 | 1 | |
| 66721 | 1 | |
| 66653 | 1 | |
| 64343 | 1 | |
| 59649 | 1 | |
| 58932 | 1 |
housing
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.3 KiB |
| True | |
|---|---|
| False |
| Value | Count | Frequency (%) |
| True | 25132 | |
| False | 20083 |
loan
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 37969 | |
| True | 7246 | 16.0% |
contact
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 353.4 KiB |
| cellular | |
|---|---|
| unknown | |
| telephone | 2903 |
| phone | 3 |
| mobile | 3 |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.775893 |
| Min length | 5 |
Characters and Unicode
| Total characters | 351587 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | unknown |
|---|---|
| 2nd row | unknown |
| 3rd row | unknown |
| 4th row | unknown |
| 5th row | unknown |
Common Values
| Value | Count | Frequency (%) |
| cellular | 29285 | |
| unknown | 13021 | |
| telephone | 2903 | 6.4% |
| phone | 3 | < 0.1% |
| mobile | 3 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cellular | 29285 | |
| unknown | 13021 | |
| telephone | 2903 | 6.4% |
| phone | 3 | < 0.1% |
| mobile | 3 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 90761 | |
| u | 42306 | |
| n | 41969 | |
| e | 38000 | |
| c | 29285 | 8.3% |
| a | 29285 | 8.3% |
| r | 29285 | 8.3% |
| o | 15930 | 4.5% |
| k | 13021 | 3.7% |
| w | 13021 | 3.7% |
| Other values (6) | 8724 | 2.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 351587 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 90761 | |
| u | 42306 | |
| n | 41969 | |
| e | 38000 | |
| c | 29285 | 8.3% |
| a | 29285 | 8.3% |
| r | 29285 | 8.3% |
| o | 15930 | 4.5% |
| k | 13021 | 3.7% |
| w | 13021 | 3.7% |
| Other values (6) | 8724 | 2.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 351587 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 90761 | |
| u | 42306 | |
| n | 41969 | |
| e | 38000 | |
| c | 29285 | 8.3% |
| a | 29285 | 8.3% |
| r | 29285 | 8.3% |
| o | 15930 | 4.5% |
| k | 13021 | 3.7% |
| w | 13021 | 3.7% |
| Other values (6) | 8724 | 2.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 351587 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 90761 | |
| u | 42306 | |
| n | 41969 | |
| e | 38000 | |
| c | 29285 | 8.3% |
| a | 29285 | 8.3% |
| r | 29285 | 8.3% |
| o | 15930 | 4.5% |
| k | 13021 | 3.7% |
| w | 13021 | 3.7% |
| Other values (6) | 8724 | 2.5% |
day
Real number (ℝ)
| Distinct | 31 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.805839 |
| Minimum | 1 |
|---|---|
| Maximum | 31 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 8 |
| median | 16 |
| Q3 | 21 |
| 95-th percentile | 29 |
| Maximum | 31 |
| Range | 30 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 8.322473 |
|---|---|
| Coefficient of variation (CV) | 0.52654422 |
| Kurtosis | -1.059853 |
| Mean | 15.805839 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.093156482 |
| Sum | 714661 |
| Variance | 69.263557 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 2752 | 6.1% |
| 18 | 2308 | 5.1% |
| 21 | 2026 | 4.5% |
| 17 | 1939 | 4.3% |
| 6 | 1932 | 4.3% |
| 5 | 1910 | 4.2% |
| 14 | 1848 | 4.1% |
| 8 | 1843 | 4.1% |
| 28 | 1830 | 4.0% |
| 7 | 1817 | 4.0% |
| Other values (21) | 25010 |
| Value | Count | Frequency (%) |
| 1 | 322 | 0.7% |
| 2 | 1294 | |
| 3 | 1079 | |
| 4 | 1445 | |
| 5 | 1910 | |
| 6 | 1932 | |
| 7 | 1817 | |
| 8 | 1843 | |
| 9 | 1561 | |
| 10 | 524 | 1.2% |
| Value | Count | Frequency (%) |
| 31 | 643 | 1.4% |
| 30 | 1566 | |
| 29 | 1745 | |
| 28 | 1830 | |
| 27 | 1121 | |
| 26 | 1035 | |
| 25 | 840 | |
| 24 | 447 | 1.0% |
| 23 | 939 | |
| 22 | 905 |
month
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 353.4 KiB |
| may | |
|---|---|
| jul | |
| aug | |
| jun | |
| nov | |
| Other values (7) |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 135645 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | may |
|---|---|
| 2nd row | may |
| 3rd row | may |
| 4th row | may |
| 5th row | may |
Common Values
| Value | Count | Frequency (%) |
| may | 13768 | |
| jul | 6895 | |
| aug | 6247 | |
| jun | 5342 | 11.8% |
| nov | 3971 | 8.8% |
| apr | 2932 | 6.5% |
| feb | 2649 | 5.9% |
| jan | 1403 | 3.1% |
| oct | 738 | 1.6% |
| sep | 579 | 1.3% |
| Other values (2) | 691 | 1.5% |
Length
| Value | Count | Frequency (%) |
| may | 13768 | |
| jul | 6895 | |
| aug | 6247 | |
| jun | 5342 | 11.8% |
| nov | 3971 | 8.8% |
| apr | 2932 | 6.5% |
| feb | 2649 | 5.9% |
| jan | 1403 | 3.1% |
| oct | 738 | 1.6% |
| sep | 579 | 1.3% |
| Other values (2) | 691 | 1.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 24827 | |
| u | 18484 | |
| m | 14245 | |
| y | 13768 | |
| j | 13640 | |
| n | 10716 | |
| l | 6895 | 5.1% |
| g | 6247 | 4.6% |
| o | 4709 | 3.5% |
| v | 3971 | 2.9% |
| Other values (9) | 18143 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 135645 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 24827 | |
| u | 18484 | |
| m | 14245 | |
| y | 13768 | |
| j | 13640 | |
| n | 10716 | |
| l | 6895 | 5.1% |
| g | 6247 | 4.6% |
| o | 4709 | 3.5% |
| v | 3971 | 2.9% |
| Other values (9) | 18143 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 135645 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 24827 | |
| u | 18484 | |
| m | 14245 | |
| y | 13768 | |
| j | 13640 | |
| n | 10716 | |
| l | 6895 | 5.1% |
| g | 6247 | 4.6% |
| o | 4709 | 3.5% |
| v | 3971 | 2.9% |
| Other values (9) | 18143 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 135645 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 24827 | |
| u | 18484 | |
| m | 14245 | |
| y | 13768 | |
| j | 13640 | |
| n | 10716 | |
| l | 6895 | 5.1% |
| g | 6247 | 4.6% |
| o | 4709 | 3.5% |
| v | 3971 | 2.9% |
| Other values (9) | 18143 |
duration
Real number (ℝ)
| Distinct | 1575 |
|---|---|
| Distinct (%) | 3.5% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 258.07436 |
| Minimum | -1389 |
|---|---|
| Maximum | 4918 |
| Zeros | 3 |
| Zeros (%) | < 0.1% |
| Negative | 2 |
| Negative (%) | < 0.1% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | -1389 |
|---|---|
| 5-th percentile | 35 |
| Q1 | 103 |
| median | 180 |
| Q3 | 319 |
| 95-th percentile | 750.35 |
| Maximum | 4918 |
| Range | 6307 |
| Interquartile range (IQR) | 216 |
Descriptive statistics
| Standard deviation | 257.60517 |
|---|---|
| Coefficient of variation (CV) | 0.99818199 |
| Kurtosis | 18.161989 |
| Mean | 258.07436 |
| Median Absolute Deviation (MAD) | 93 |
| Skewness | 3.134025 |
| Sum | 11668574 |
| Variance | 66360.426 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 124 | 188 | 0.4% |
| 90 | 184 | 0.4% |
| 89 | 177 | 0.4% |
| 114 | 175 | 0.4% |
| 104 | 175 | 0.4% |
| 122 | 175 | 0.4% |
| 136 | 174 | 0.4% |
| 112 | 174 | 0.4% |
| 139 | 174 | 0.4% |
| 121 | 173 | 0.4% |
| Other values (1565) | 43445 |
| Value | Count | Frequency (%) |
| -1389 | 1 | < 0.1% |
| -517 | 1 | < 0.1% |
| 0 | 3 | < 0.1% |
| 1 | 2 | < 0.1% |
| 2 | 3 | < 0.1% |
| 3 | 4 | < 0.1% |
| 4 | 15 | < 0.1% |
| 5 | 35 | |
| 6 | 45 | |
| 7 | 73 |
| Value | Count | Frequency (%) |
| 4918 | 1 | |
| 3881 | 1 | |
| 3785 | 1 | |
| 3422 | 1 | |
| 3366 | 1 | |
| 3322 | 1 | |
| 3284 | 1 | |
| 3253 | 1 | |
| 3183 | 1 | |
| 3102 | 1 |
campaign
Real number (ℝ)
| Distinct | 48 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.7637289 |
| Minimum | 1 |
|---|---|
| Maximum | 63 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 2 |
| Q3 | 3 |
| 95-th percentile | 8 |
| Maximum | 63 |
| Range | 62 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 3.0979102 |
|---|---|
| Coefficient of variation (CV) | 1.1209168 |
| Kurtosis | 39.252662 |
| Mean | 2.7637289 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 4.8988412 |
| Sum | 124962 |
| Variance | 9.5970477 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 17546 | |
| 2 | 12507 | |
| 3 | 5521 | 12.2% |
| 4 | 3522 | 7.8% |
| 5 | 1764 | 3.9% |
| 6 | 1291 | 2.9% |
| 7 | 735 | 1.6% |
| 8 | 540 | 1.2% |
| 9 | 327 | 0.7% |
| 10 | 266 | 0.6% |
| Other values (38) | 1196 | 2.6% |
| Value | Count | Frequency (%) |
| 1 | 17546 | |
| 2 | 12507 | |
| 3 | 5521 | 12.2% |
| 4 | 3522 | 7.8% |
| 5 | 1764 | 3.9% |
| 6 | 1291 | 2.9% |
| 7 | 735 | 1.6% |
| 8 | 540 | 1.2% |
| 9 | 327 | 0.7% |
| 10 | 266 | 0.6% |
| Value | Count | Frequency (%) |
| 63 | 1 | < 0.1% |
| 58 | 1 | < 0.1% |
| 55 | 1 | < 0.1% |
| 51 | 1 | < 0.1% |
| 50 | 2 | |
| 46 | 1 | < 0.1% |
| 44 | 1 | < 0.1% |
| 43 | 3 | |
| 41 | 2 | |
| 39 | 1 | < 0.1% |
pdays
Real number (ℝ)
| Distinct | 559 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.192485 |
| Minimum | -1 |
|---|---|
| Maximum | 871 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 36957 |
| Negative (%) | 81.7% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | -1 |
| Q1 | -1 |
| median | -1 |
| Q3 | -1 |
| 95-th percentile | 317 |
| Maximum | 871 |
| Range | 872 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 100.12062 |
|---|---|
| Coefficient of variation (CV) | 2.4910284 |
| Kurtosis | 6.9373411 |
| Mean | 40.192485 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.6159947 |
| Sum | 1817263 |
| Variance | 10024.139 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -1 | 36957 | |
| 182 | 167 | 0.4% |
| 92 | 147 | 0.3% |
| 91 | 126 | 0.3% |
| 183 | 126 | 0.3% |
| 181 | 117 | 0.3% |
| 370 | 99 | 0.2% |
| 184 | 85 | 0.2% |
| 364 | 77 | 0.2% |
| 95 | 74 | 0.2% |
| Other values (549) | 7239 | 16.0% |
| Value | Count | Frequency (%) |
| -1 | 36957 | |
| 1 | 15 | < 0.1% |
| 2 | 37 | 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 11 | < 0.1% |
| 6 | 10 | < 0.1% |
| 7 | 7 | < 0.1% |
| 8 | 25 | 0.1% |
| 9 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 871 | 1 | |
| 854 | 1 | |
| 850 | 1 | |
| 842 | 1 | |
| 838 | 1 | |
| 831 | 1 | |
| 828 | 1 | |
| 826 | 1 | |
| 808 | 1 | |
| 805 | 1 |
previous
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.58038262 |
| Minimum | 0 |
|---|---|
| Maximum | 275 |
| Zeros | 36957 |
| Zeros (%) | 81.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 353.4 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 3 |
| Maximum | 275 |
| Range | 275 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 2.3034378 |
|---|---|
| Coefficient of variation (CV) | 3.9688263 |
| Kurtosis | 4506.4832 |
| Mean | 0.58038262 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 41.843008 |
| Sum | 26242 |
| Variance | 5.3058256 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 36957 | |
| 1 | 2772 | 6.1% |
| 2 | 2106 | 4.7% |
| 3 | 1142 | 2.5% |
| 4 | 714 | 1.6% |
| 5 | 460 | 1.0% |
| 6 | 277 | 0.6% |
| 7 | 205 | 0.5% |
| 8 | 129 | 0.3% |
| 9 | 92 | 0.2% |
| Other values (31) | 361 | 0.8% |
| Value | Count | Frequency (%) |
| 0 | 36957 | |
| 1 | 2772 | 6.1% |
| 2 | 2106 | 4.7% |
| 3 | 1142 | 2.5% |
| 4 | 714 | 1.6% |
| 5 | 460 | 1.0% |
| 6 | 277 | 0.6% |
| 7 | 205 | 0.5% |
| 8 | 129 | 0.3% |
| 9 | 92 | 0.2% |
| Value | Count | Frequency (%) |
| 275 | 1 | |
| 58 | 1 | |
| 55 | 1 | |
| 51 | 1 | |
| 41 | 1 | |
| 40 | 1 | |
| 38 | 2 | |
| 37 | 2 | |
| 35 | 1 | |
| 32 | 1 |
poutcome
Categorical
IMBALANCE 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 353.4 KiB |
| unknown | |
|---|---|
| failure | |
| other | 1840 |
| success | 1509 |
| UNK | 4 |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 6.9182572 |
| Min length | 3 |
Characters and Unicode
| Total characters | 312809 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | unknown |
|---|---|
| 2nd row | unknown |
| 3rd row | unknown |
| 4th row | unknown |
| 5th row | unknown |
Common Values
| Value | Count | Frequency (%) |
| unknown | 36958 | |
| failure | 4902 | 10.8% |
| other | 1840 | 4.1% |
| success | 1509 | 3.3% |
| UNK | 4 | < 0.1% |
| Success | 2 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| unknown | 36958 | |
| failure | 4902 | 10.8% |
| other | 1840 | 4.1% |
| success | 1511 | 3.3% |
| unk | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 110874 | |
| u | 43371 | 13.9% |
| o | 38798 | 12.4% |
| k | 36958 | 11.8% |
| w | 36958 | 11.8% |
| e | 8253 | 2.6% |
| r | 6742 | 2.2% |
| i | 4902 | 1.6% |
| l | 4902 | 1.6% |
| a | 4902 | 1.6% |
| Other values (9) | 16149 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 312795 | |
| Uppercase Letter | 14 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 110874 | |
| u | 43371 | 13.9% |
| o | 38798 | 12.4% |
| k | 36958 | 11.8% |
| w | 36958 | 11.8% |
| e | 8253 | 2.6% |
| r | 6742 | 2.2% |
| i | 4902 | 1.6% |
| l | 4902 | 1.6% |
| a | 4902 | 1.6% |
| Other values (5) | 16135 | 5.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 4 | |
| N | 4 | |
| K | 4 | |
| S | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 312809 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 110874 | |
| u | 43371 | 13.9% |
| o | 38798 | 12.4% |
| k | 36958 | 11.8% |
| w | 36958 | 11.8% |
| e | 8253 | 2.6% |
| r | 6742 | 2.2% |
| i | 4902 | 1.6% |
| l | 4902 | 1.6% |
| a | 4902 | 1.6% |
| Other values (9) | 16149 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 312809 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 110874 | |
| u | 43371 | 13.9% |
| o | 38798 | 12.4% |
| k | 36958 | 11.8% |
| w | 36958 | 11.8% |
| e | 8253 | 2.6% |
| r | 6742 | 2.2% |
| i | 4902 | 1.6% |
| l | 4902 | 1.6% |
| a | 4902 | 1.6% |
| Other values (9) | 16149 | 5.2% |
y
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 44.3 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 39925 | |
| True | 5290 | 11.7% |
| age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 58 | management | married | tertiary | no | 2143.0 | yes | no | unknown | 5 | may | 261.0 | 1 | -1.0 | 0 | unknown | no |
| 1 | 44 | technician | single | secondary | no | 29.0 | yes | no | unknown | 5 | may | 151.0 | 1 | -1.0 | 0 | unknown | no |
| 2 | 33 | entrepreneur | married | secondary | no | 2.0 | yes | yes | unknown | 5 | may | 76.0 | 1 | -1.0 | 0 | unknown | no |
| 3 | 47 | blue-collar | married | unknown | no | 1506.0 | yes | no | unknown | 5 | may | 92.0 | 1 | -1.0 | 0 | unknown | no |
| 4 | 33 | unknown | single | unknown | no | 1.0 | no | no | unknown | 5 | may | 198.0 | 1 | -1.0 | 0 | unknown | no |
| 5 | 35 | management | married | tertiary | no | 231.0 | yes | no | unknown | 5 | may | 139.0 | 1 | -1.0 | 0 | unknown | no |
| 6 | 28 | Management | single | tertiary | no | 447.0 | yes | yes | unknown | 5 | may | 217.0 | 1 | -1.0 | 0 | unknown | no |
| 7 | 42 | entrepreneur | div. | tertiary | yes | 2.0 | yes | no | unknown | 5 | may | 380.0 | 1 | -1.0 | 0 | unknown | no |
| 8 | 58 | retired | married | primary | no | 121.0 | yes | no | unknown | 5 | may | 50.0 | 1 | -1.0 | 0 | unknown | no |
| 9 | 43 | technician | single | secondary | no | 593.0 | yes | No | unknown | 5 | may | 55.0 | 1 | -1.0 | 0 | unknown | no |
| age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 45205 | 53 | management | married | tertiary | no | 583.0 | no | no | cellular | 17 | nov | 226.0 | 1 | 184.0 | 4 | success | yes |
| 45206 | 34 | admin. | single | secondary | no | 557.0 | no | no | cellular | 17 | nov | 224.0 | 1 | -1.0 | 0 | unknown | yes |
| 45207 | 23 | student | single | tertiary | no | 113.0 | no | no | cellular | 17 | nov | 266.0 | 1 | -1.0 | 0 | unknown | yes |
| 45208 | 73 | retired | married | secondary | no | 2850.0 | no | no | cellular | 17 | nov | 300.0 | 1 | 40.0 | 8 | failure | yes |
| 45209 | 25 | technician | single | secondary | no | 505.0 | no | yes | cellular | 17 | nov | 386.0 | 2 | -1.0 | 0 | unknown | yes |
| 45210 | 51 | technician | married | tertiary | no | 825.0 | no | no | cellular | 17 | nov | 977.0 | 3 | -1.0 | 0 | unknown | yes |
| 45211 | 71 | retired | divorced | primary | no | 1729.0 | no | no | cellular | 17 | nov | 456.0 | 2 | -1.0 | 0 | unknown | yes |
| 45212 | 72 | retired | married | secondary | no | 5715.0 | no | no | cellular | 17 | nov | 1127.0 | 5 | 184.0 | 3 | success | yes |
| 45213 | 57 | blue-collar | married | secondary | no | 668.0 | no | no | telephone | 17 | nov | 508.0 | 4 | -1.0 | 0 | unknown | no |
| 45214 | 37 | entrepreneur | married | secondary | no | 2971.0 | no | no | cellular | 17 | nov | 361.0 | 2 | 188.0 | 11 | other | no |
Most frequently occurring
| age | job | marital | education | default | balance | housing | loan | contact | day | month | duration | campaign | pdays | previous | poutcome | y | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 29 | technician | single | tertiary | no | 18254.0 | no | no | cellular | 11 | may | 279.0 | 2 | -1.0 | 0 | unknown | no | 2 |
| 1 | 43 | blue-collar | married | secondary | yes | -7.0 | no | no | unknown | 8 | may | 70.0 | 1 | -1.0 | 0 | unknown | no | 2 |
| 2 | 52 | technician | divorced | secondary | no | 1005.0 | yes | no | cellular | 2 | jun | 195.0 | 1 | -1.0 | 0 | unknown | yes | 2 |
| 3 | 59 | management | married | tertiary | no | 138.0 | yes | yes | cellular | 16 | nov | 162.0 | 2 | 187.0 | 5 | failure | no | 2 |